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Heisenberg's uncertainty principle is formulated for a set of generalized measurements within the 
framework of majorization theory, resulting in a partial uncertainty order on probability vectors 
that is stronger than those based on quasi-entropic measures. The theorem that results from this 
formulation guarantees that the uncertainty of the results of a set of generalized measurements 
without a common eigenstate has an inviolable lower bound which depends on the measurement 
set but not the state. A corollary to this theorem yields a parallel formulation of the uncertainty 
principle for generalized measurements based on quasi-entropic measures. Optimal majorization 
bounds for two and three mutually unbiased bases in two dimensions are calculated. Similarly, the 
leading term of the majorization bound for position and momentum measurements is calculated 
which provides a strong statement of Heisenberg's uncertainty principle in direct operational terms. 
Another theorem provides a majorization condition for the least uncertain generalized measurement 
of a given state with interesting physical implications. 



PACS numbers: 03.65.Ta, 03.65.Ca, 03.67.-a, 03.67.Mn 



I. INTRODUCTION 



Heisenberg's uncertainty principle codifies certain in- 
herent limitations on the simultaneous knowledge of ob- 
servables in the microscopic realm, and as such consti- 
tutes one of the conceptual pillars of quantum theory 
[1]. The limitations implied by the uncertainty prin- 
ciple played an important role in the celebrated Bohr- 
Einstein debates and the formative years of quantum 
theory, and have since served as a source of insight on 
the structure and behavior of microscopic systems. Hav- 
ing recognized that the non-commutativity of a pair of 
observables implies an irreducible indeterminacy in the 
simultaneous knowledge of their values, Heisenberg pre- 
sented semi-quantitative arguments to establish a univer- 
sal lower bound of the order of Planck's constant for the 
product of their uncertainties. Heisenberg's arguments 
were subsequently formulated in a mathematically pre- 
cise manner by Kennard [2] and extended by Robertson 
[3] and Schrodinger [4], all of whom adopted the square 
root of variance as the measure of uncertainty. The vari- 
ance formulation of the uncertainty principle, which is 
the one familiar from textbook accounts, is often use- 
ful when applied to canonically conjugate observables for 
estimating spectral and structural properties of microsys- 
tems. 

More than half a century later it was realized that 
entropy is a more effective measure for capturing the in- 
formation theoretical aspects of uncertainty, especially 
when applied to noncanonical observables. The key idea 
was to quantify uncertainty as the information associated 
with the probabilities of measurement outcomes rather 
than the variance in the values of the measured observ- 
able. In a seminal paper [5], Deutsch argued that such 
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a measure would be superior to the traditional one when 
dealing with observables of finite rank, and developed an 
entropic formulation of the uncertainty principle based 
on an inviolable lower bound for the sum of Shannon en- 
tropies associated with projective measurements of non- 
commuting observables. He also emphasized that, in con- 
trast to the variance formulation in its generalized form, 
the irreducible lower bound in the entropic formulation 
depends on the observables but not the state. 

In a subsequent work, this author showed that an ex- 
tension of Deutsch's formulation to infinite-rank observ- 
ables such as position and momentum, whether discrete 
or continuous, entails a nontrivial consideration of the 
resolution of the measuring device [ti] . The resulting en- 
tropic measure was a more realistic description of uncer- 
tainty for canonical observables than the variance formu- 
lation, and in contrast to the latter, was mathematically 
well defined for any state of the system and an arbitrary 
set of observables, even if infinite-rank. Consequently, 
while Deutsch's primary objective was to deal with the 
shortcomings of the variance formulation when applied to 
discrete spectra and finite-dimensional observables, the 
extension to continuous spectra such as position and mo- 
mentum turned out to be quite potent as well. 

The entropic formulation has since been refined by de- 
riving sharper bounds [7, 8] and extended by introducing 
mutually unbiased observables [8], generalized measure- 
ments [9], alternative entropy functions such as those of 
Tsallis [10] and Renyi [11], and other innovations [12]. Its 
applications include the quantum formulation of Jaynes' 
maximum entropy method and time-energy uncertainty 
relations [A-!], and more recently quantum cryptography, 
information locking, and entanglement detection [12, 14]. 
In particular, the recent applications to quantum infor- 
mation and entanglement theory clearly underscore the 
importance of basing uncertainty considerations on prob- 
ability vectors resulting from measurements rather than 
values of measured observables. It should also be noted 
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here that the idea of entropy as a measure of uncertainty 
has an interesting ear her history [15]. 

In this paper we develop a majorization formulation 
of the uncertainty principle. Majorization provides a 
partial order on probability vectors which characterizes 
the degree of their disorder, or uncertainty, and is nat- 
urally suited for application to measurement results. It 
is based on the intuitive but surprisingly powerful no- 
tion that a probability vector which is a mixture of the 
permutations of another is more disordered. This simple 
condition gives rise to a measure of uncertainty that is 
more discriminating than any measure based on a sym- 
metric, concave function defined on probability vectors. 
We shall refer to this class of measures, which includes 
the Shannon and Tsallis (but not Renyi) entropies, as 
"quasi-entropic." Thus the majorization order implies 
any quasi-entropic order, but not vise versa [16]. Indeed 
the main theorem of the present formulation directly im- 
plies the existence of a class of scalar formulations of 
the uncertainty principle for generalized measurements 
based on such quasi-entropic functions. This generaliza- 
tion serves, inter alia, to extend the standard (Shannon) 
entropic measure of uncertainty to a set of generalized 
measurements. 

It is important to note here that quasi-entropic or- 
der is total and based on a single inequality, whereas 
the majorization order is partial (i.e., not every pair of 
probability vectors can be ordered) and rests on TV — 1 
inequalities, where N is the number of nonzero entries in 
the less uncertain probability vector under comparison 
(see §IIB for details). Consequently, while the majoriza- 
tion order is equivalent to quasi-entropic ones for N <2, 
it becomes progressively more stringent with increasing 
N . It is this feature of majorization, namely matching 
the complexity of the probe to that of the object, that 
makes it particularly effective as a comparator of disor- 
der. It should therefore come as no surprise that some 
of the most fundamental results of quantum information 
and entanglement theory are based on majorization rela- 
tions [17]. We believe that its use in the present formu- 
lation likewise serves to extend the reach and power of 
the uncertainty principle. 

The rest of this paper is organized as follows: In §IIA 
we describe measurement types and establish notation, in 
§IIB we introduce the relevant elements of majorization 
theory, and in §IIC we relate concepts of measurement 
uncertainty to majorization relations. In §IIIA and IIIB 
we establish the central result of this paper. Theorem 1, 
parts A and B, and state the physical content of the un- 
certainty principle in majorization terms. In §IIIC, we 
construct the class of quasi-entropic measures of uncer- 
tainty as a corollary to Theorem 1. We apply Theorem 1 
to mutually unbiased observables in §IVA and IVB, and 
to position and momentum in §V. In §VI we establish the 
least uncertain measurement of a quantum state in Theo- 
rem 2, and state its relation to the von Neumann entropy 
and spectrum of the state. We conclude the paper with 
a few remarks in §VII. 



II. MEASUREMENT UNCERTAINTY AND 
MAJORIZATION 

We start by introducing the concepts and methods that 
underlie the majorization formulation of uncertainty in 
the following subsections. 



A. Measurements 

To establish our notation and nomenclature, we start 
by defining types of measurement. A generalized mea- 
surement is defined by a set of measurement operators 
{SHa} subject to the completeness condition X^a^a = 1, 
where = QJIJ^OJIq is called a measurement element and 
the index a identifies the possible measurement outcomes 
[Ls]. Note that each measurement element is a bounded, 
positive, self-adjoint operator whose norm cannot exceed 
unity. The probability that outcome a turns up in a 
measurement of the state p is given by the Born rule 
J'c<(/5) = tr[EQ,/5], with the post-measurement state given 
by p'^ = Map^Hjj/T'a [20]. If p is not a pure state, then 
p[^ will not in general be pure unless OTq, is of rank 1 [2 ]. 
A measurement is rank-1 if every measurement operator 
is of rank 1. Rank-1 measurements are thus seen to have 
the highest resolution among generalized measurements 
in the sense that the range of their measurement opera- 
tors consists of a single pure state. 

A generalized measurement can always be considered 
as a restriction of a more basic type of measurement, 
namely a projective measurement performed on an en- 
larged system, to the system under generalized measure- 
ment [18]. A projective measurement is commonly as- 
sociated with an observable of the system, which would 
be represented by some self-adjoint operator M. Such 
a measurement entails a partitioning of the spectrum of 
M into a collection of subsets {b^} called bins [(>]. This 
partition induces a corresponding partition of the Hilbert 
space into orthogonal subspaces with the corresponding 
projection operators being the measurement operators 
[19]. Thus for projective measurements, OT^^ — 11^^, 
where ft*^ is the projection operator onto the subspace 
corresponding to b^ . We call a projective measurement 
maximal if each bin consists of a single point of the spec- 
trum of the measured observable. Note that a maxi- 
mal measurement of an observable with a non-degenerate 
spectrum, the type usually described in textbook ac- 
counts, is rank-1. It is important to understand that 
physically realizable measurements arc limited to a finite 
set of outcomes so that any measurement of an observ- 
able with a continuous spectrum such as position or mo- 
mentum, or even an infinite discrete spectrum such as 
the energy of a harmonic oscillator, must necessarily in- 
volve infinite-rank measurement operators and cannot be 
maximal [(>]. Equivalently, only systems dcscribable by 
finite-dimensional Hilbert spaces (such as spin systems) 
admit maximal measurements. 
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B. Majorization relations 

We now turn to a brief introduction to the basics of 
majorization relations, a topic that has found important 
apphcations in quantum information and entanglement 
theory [IG, 17]. With every vector A we associate an- 
other vector A''' which is obtained from A by arranging 
the components of the latter in a descending (i.e., nonin- 
creasing) order. Then, given a pair of vectors A^ and A^, 
A^ is said to be majorized by A^ and written A^ ~< A^ if 
— for j = 1, 2, ... d, where d is the larger 

of the two dimensions and trailing zeros are added where 
needed. An equivalent but intuitively more compelling 
definition is that A^ -< A^ if and only if A^ equals a 
mixture of permutations of A^. As stated earlier, the 
majorization relation is a partial order, i.e., that not ev- 
ery two vectors are comparable under majorization. Note 
that this is not a shortcoming of majorization, but rather 
a consequence of its more rigorous protocol for ordering 
uncertainty. Indeed as mentioned in §1, for any function 
F{X) of the quasi-entropic type like Shannon and Tsal- 
lis entropies, A^ -< A^ implies F{\^) > F{\'^), but not 
conversely [!(>]. On the other hand, if for every such 
function F{\) we have F{X^) > F(A2), then A^ -< A^. 
Clearly, the majorization relation as a comparator of dis- 
order is stronger than any single quasi-entropic measure, 
and in a sense is equivalent to all such measures taken 
collectively. 

Another concept needed in the following is that of the 
infimum of a set of N vectors, defined as the vector that 
is majorized by every element of the set and in turn ma- 
jorizes any vector with that property [22]. The supremum 
is similarly defined as the vector that majorizes every el- 
ement of the set and is in turn majorized by any vector 
with that property [23]. To construct these extremal vec- 
tors, we consider a vector /x*"-^ with components 

Mo -0, M,- =mm(^^^^\ 'Z^,=i^« 

ELi^^"^^)' ^^J^d,r.a., (1) 

where dmax is the largest dimension found in the set [24] . 
The desired infimum is then given by 

A^^ = [inf(A^A^...,A^)], = M^■'-A^:-{, (2) 

where 1 < i < dmax- One can show that A™-'' as given by 
Eq. (2) is a descending sequence [25]. While the construc- 
tion given in Eq. (2) guarantees that A*"-'' is majorized by 
every element of the set, the descending property guar- 
antees that any other vector with that property is in turn 
majorized by A™^. 

A parallel construction to the above with "min" in 
Eq. (1) replaced with "max" yields a sequence that ma- 
jorizes every element of the set, but one that does not 
necessarily emerge in a descending order and may there- 
for fail to be majorized by any other sequence that has 



the same property, as required. In such a case, the se- 
quence so obtained. A, can be modified by a "flatten- 
ing" process that, while maintaining the property of ma- 
jorizing every element of the set, culminates in a se- 
quence that is descending as well. The flattening process 
starts with A, and for every pair of components violat- 
ing the descending property, i.e., A^+i > Ai, replaces the 
pair by their mean such that the updated elements are 
K = K+i = (Ai + Ai+i)/2. This process of "flattening" is 
then continued until a descending sequence correspond- 
ing to the supremum A'*"^ is obtained [2G]. 

C. Uncertainty 

We are now in a position to characterize uncertainty 
by means of majorization relations. The probability vec- 
tor y-^ (p) resulting from a measurement X on a state p is 
said to be uncertain if it is majorized by I = (1, 0, . . . , 0) 
but not equal to it. As such, T^ip) is said to be strictly 
majorized by I and written (p) I. Similarly, p 
is more uncertain with respect to measurement X than 
with respect to Y if y^ip) -< y^{p). Furthermore, we 
define the joint uncertainty of a pair of measurements 
X and Y by means of the outer product y-^ (g) T^, i.e., 

where H(-) is the Shannon entropy function, this defini- 
tion is seen to be consistent with its entropic counterpart. 
As stated earlier, -< y^ implies H{y^) > H{y^) 
but not conversely. Note that the foregoing definitions 
naturally extend to an arbitrary number of states and 
measurements. 

It is worth repeating here that the partial nature of the 
majorization order implies that not all measurement re- 
sults are comparable under our uncertainty order. While 
a total uncertainty order such as given by the entropic 
formulation would be simpler to deal with, it would also 
embody less information as an averaged out feature of a 
probability vector, especially for vectors of high disorder. 
A concomitant of partial order is that the infimum or 
supremum of a set of probability vectors will in general 
not be a member of the set. In other words, a set of 
probability vectors does not in general have a least un- 
certain element, although it may have multiple elements 
that are not more uncertain than any other probability 
vector [27]. While one may wish to do away with these 
unfamiliar features, they are nevertheless a small price 
to pay for a high-resolution comparator of uncertainty. 



III. MAJORIZATION FORMULATION OF THE 
UNCERTAINTY PRINCIPLE 

The information theoretical expression of the uncer- 
tainty principle may be stated as the requirement that 
the information available from a set of measurements 
performed on a system is subject to an irreducible level 
of uncertainty unless the measurements have a common 
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eigenstate. In the latter case, the outcome of every mea- 
surement would be certain and unique, and the informa- 
tion so obtained would be complete for the given set of 
measurements. An important additional requirement is 
that the said irreducible level of uncertainty be a property 
of the measurement set and not depend on the state of 
the system. The entropic formulation implements these 
requirements by identifying the sum of the entropies as- 
sociated with the measurements in the set as the measure 
of their joint uncertainty [5] . The majorization formula- 
tion, by contrast, relies on the outer product of proba- 
bility vectors resulting from the measurements as repre- 
sentative of their joint uncertainty. An example of this 
definition for a set of two measurements was stated in 
§IIC. The corresponding irreducible level of uncertainty 
is then defined to be the supremum of such outer prod- 
ucts as all possible states are considered. This supremum 
is then guaranteed to be less uncertain than the probabil- 
ity vector resulting from the measurement of any possible 
state of the system. As also noted in §IIC, the Shannon 
entropy of the outer product of a set of probability vec- 
tors equals the sum of their entropies, which guarantees 
that the two formulations are consistent. It also implies 
that a formulation of the uncertainty principle in terms 
of majorization implies a parallel entropic formulation, 
as will be demonstrated in §IIIC. 

In the following we will establish a theorem that em- 
bodies the above statements in a mathematically pre- 
cise manner. We will treat the cases where measurement 
elements have discrete spectra and behave similarly to 
Hermitian matrices acting on finite-dimensional spaces, 
namely compact measurement elements, separately from 
other cases such as canonical observables and continu- 
ous spectra [28]. The reason for this separate treatment 
is simply that the very notion of an eigenstate may lose 
proper mathematical meaning for non-compact measure- 
ment elements [29]. Needless to say, the physical content 
of the uncertainty principle is unaffected by these math- 
ematical details. 



A. Compact measurement elements 

Here we consider a measurement set whose elements 
are compact. Since measurement elements are bounded, 
positive, self-adjoint operators, the condition of compact- 
ness will guarantee that they only have a discreet spec- 
trum consisting of positive eigenvalues, each with a finite 
multiplicity, except possibly for a clustering of eigenval- 
ues near zero [2n]. Since we will primarily be concerned 
with eigenvalues away from zero, compact elements es- 
sentially behave as positive Hermitian matrices (of finite 
dimension) for our purposes. Note, however, that while 
finite-rank measurement elements are necessarily com- 
pact, the converse is not true. Note also that most discus- 
sions of uncertainty in the literature deal with finite-rank, 
hence compact, elements, and often just rank-1 projec- 
tion operators as in the case of maximal projective mea- 



surements. 

Theorem lA. Let {T^ (p) , (p) , . . . , V'^ (p)} be the 
set of probability vectors resulting from a set of general- 
ized measurements {X, Y, . . . , Z} with compact elements 
on the state p. Then 

<E) V^{p) (E)...(E)7^{p)^ I, (3) 

where 

unless the measurement elements {E"^, E^, . . . , E^} have 
a common eigenstate in which case J'^^® ' ®^ = L 

It is important to note that J'^^® depends on the 
measurement set but is independent of the state p. Note 
also that as the supremum of all possible measurement 
outcomes for the measurement set, it is the probability 
vector that sets the irreducible lower bound to uncer- 
tainty for the set. As such, it is analogous to variance or 
entropic lower bounds for existing formulations of the un- 
certainty principle. Unlike the scalar bounds of the vari- 
ance and entropic formulations, however, J'^^® ' ®^ is 
in general a vector quantity whose dimension is variable 
and grows with the complexity of the measurement set. 
In addition, iP^^® "®^ is in general not realizable on 
any state of the system [2.3] except under special condi- 
tions. In other words, there is generally no such thing as 
a "minimum uncertainty state" within the majorization 
framework, as alluded to in §IIC. An important exception 
to this statement is the special case of zero uncertainty 
for which J*^^®'"®^ = I, signaling the existence of a 
common eigenstate for the measurement elements as as- 
serted by Theorem lA. We will illustrate these and other 
properties for three archetypal cases in §IV and V. 

To establish Theorem lA, we need to show that 
T^^® - ®^ is strictly majorized by X, equivalently that 
its largest component is strictly less than unity, if the 
measurement elements do not have a common eigenstate. 
Suppose, on the contrary, that the largest component of 
y^^® - ®^ does equal unity while the measurement el- 
ements in the set do not possess a common eigenstate. 
Then there must exist a set of indices (a*, /?*,..., 7*) 
such that supp[T^, (/5)?J. (p) ■ ■ ■ J"^. (p)] = 1, or in terms 
of measurement elements, 

sup^[tr(Ef.p)tr(Ej.p) . . . tr(E^.p)] = 1. (5) 

Since the elements of each measurement are positive 
Hermitian operators whose sum equals the identity op- 
erator, e.g. J^cJ^a ^ 1; they must all be bounded op- 
erators with norms not exceeding unity, i.e. || Ejf ||< 1 
for every a, with similar conditions for all other mea- 
surement elements in the set. But this implies that 
tr(E^,/5) < 1, with similar conditions for all measure- 
ment elements in the set. Consequently, the only way 
Eq. (5) can be satisfied under the stated constraint on 
the norms is that (a) each measurement element is of 
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unit norm, and (b) each trace term in Eq. (5) equals 
unity, i.e., 

supp tr(E^.p, Ej.p, . . . , E^.p) = (1,1,..., 1). (6) 

Note that by probability conservation, if a subscript on 
an entry in the left-hand side of Eq. (6) is changed, the 
corresponding entry on the right-hand side must van- 
ish. Physically, Eq. (6) implies the existence of states for 
which the outcome of every measurement is essentially 
determinate, and the joint results are basically without 
any uncertainty, since the outcome of the measurement 
set {X, Y, . . . , Z} will very nearly all be events in those 
measurement "bins" that correspond to the subscript set 
(a*,/3*,...,7*). 

Mathematically, on the other hand, we note that the 
measurement elements in Eq. (6) are positive, compact 
Hermitian operators of unit norm, with discrete eigen- 
values and corresponding eigenfunctions that are com- 
plete. In addition, all nonzero eigenvalues have finite 
multiplicity. We may therefore conclude that there ex- 
ists a state p* that realizes the equalities of Eq. (6), i.e., 
that tr(E^.p*) = tr(E^./5*) = . . . = tr(E^.p*) = 1, and 
that p* is a common eigenstatc of the measurement el- 
ements {E;^. , E J. , . . . , Ey-, } with eigenvalues unity (and 
of all other elements with eigenvalue zero) [•■]()]. How- 
ever, this conclusion contradicts our starting assumption, 
thereby completing the proof of Theorem lA. 

B. Non-compact measurement elements 

In the foregoing paragraph we used the compactness 
property of the measurement elements to deduce the ex- 
istence of a common eigenstatc for them from Eq. (6). 
In the general case where non-compact elements may be 
present, the measurement elements may not even have 
properly defined eigenstates or eigenvalues, common or 
otherwise [29]. Of course the physical content of the un- 
certainty principle is still captured by Eq. (6). We will 
therefore use the physically equivalent notion of an ap- 
proximate eigenstatc in this case, defined as follows: If 
for some number a and any e > there exists a state 
■0(e) such that || {A — a)ili{e) ||< e, then ^(e) is said to 
be an approximate eigenstatc of A. 

Clearly, Eq. (6) implies the existence of an approxi- 
mate common eigenstatc for the measurement elements 
therein, thus yielding the desired result. Nevertheless, 
it is useful to develop a separate formulation and an al- 
ternative proof for this general case, especially with a 
view to deriving majorization bounds for position and 
momentum measurements in §V. 

Theorem IB. Let {y^{p),7^ {p), . . . ,y'^{p)} be the 
set of probability vectors resulting from a set of gener- 
alized measurements {X, Y, . . . , Z} on the state p. Then 
Eq. (3) of Theorem lA holds unless the measurement el- 
ements {E"^, E^, . . . , E^} have an approximate common 
eigenstatc in which case '^f^J'®'"®'^ = I- 



To establish this result, we first note that, by defini- 
tion, 

II E^ + Ej + ... + E^ ||>tr[(E^ + Ej + ... + E^)p] 

for any (normalized) density operator p. The right- 
hand side of this inequality is, by definition, equal to 
[CP^(p)-h J'J(p) + . . . + 1"^], which is a sum of non- negative 
numbers. Since the arithmetic mean of a set of non- 
negative numbers is never exceeded by their geometric 
mean, we have the inequality 

(/5) + yj(p) + . . . + y^l/n > [y^(p)J'J(/5) . . . T^]'^" 

Combining the above pair of inequalities, we arrive at 
the important conclusion that 

(p)Tj(p) . . . T^(/5) < (II E^ + Ej + . . . + E^ II /n)\ 

(7) 

where n is the number of measurements in the set. 

At this point we follow the proof of Theorem lA by 
assuming, contrary to Theorem IB, that J"^^® ' ®^ = 
T while the measurement elements do not possess 
an approximate common eigenstatc. But then there 
must exist a set of indices (a*, /3*, . . . , 7*) such that 
sup^[T^. (p)^^ (p) . . . y^, (p)] = 1. However, this equal- 
ity together with the inequality in (7) imply that 

||E^.+Ej. + ...+E^. ||=n (8) 

Since the measurement elements appearing in Eq. (8) are 
positive operators with norms not exceeding unity, we 
must conclude that they are all in fact of unit norm. 
Equation (8) further implies that for any e > 0, there 
must exist a state p* such that | 1 — tr(E/5*) |< e, where 
E stands for every measurement clement in Eq. (8). But 
this implies the existence of an approximate common 
eigenstatc contrary to our assumption, thus completing 
the proof of Theorem IB. 

We are now in a position to summarize the physical 
content of the uncertainty principle in the framework of 
majorization theory. 

The uncertainty principle. The joint results of a 
set of generalized measurements of a given state are no 
less uncertain than a probability vector that depends on 
the measurement set but not the state, and is itself un- 
certain unless the measurement elements have a common 
eigenstatc. 

In the above statement, we have dropped the qualifi- 
cation "approximate" in referring to eigenfunctions since 
this is the common practice in the physics literature as 
well as the fact that it makes little difference for the phys- 
ical content of the uncertainty principle. 

C. Quasi-Entropic formulations of uncertainty 

Theorems lA and IB immediately imply a parallel 
formulation of the uncertainty principle for generalized 
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measurements based on quasi-entropic measures. We 
recall that a quasi-entropic measure of uncertainty is 
any symmetric, concave function of the components of 
<pX®Ye - ®Z|-^'j i-gg^jting from generalized measurements 
{X, Y, . . . , Z} on the state (p) [3t]. Specifically, for every 
quasi-entropic function F, wc define a scalar uncertainty 
measure according to 

where we have normalized the measure such that it van- 
ishes when measurement results have zero uncertainty 
and is positive otherwise. 

A special class of quasi-entropic measures is obtained 
if we choose 

^[yxeYe...eZ(^)] ^^^^ /[T^(/5)Tj(p) . . . ^^(p)], 

(10) 

where / is a concave function of a single variable. Note 
that F as constructed in Eq. (10) is manifestly symmetric 
and, as a sum of concave functions, it is also concave. 

The standard entropic measure of uncertainty [5] corre- 
sponds to the choice f{x) = H{x) ~ — a;ln(a-) in Eq. (10), 
in which case the sum on the right-hand side simplifies 
to the sum of Shannon entropies for each measurement. 
This example serves to demonstrate that the class of mea- 
sures introduced in Eq. (9) is a vast generalization of the 
standard entropic measure of uncertainty, not only in the 
functional form of the uncertainty measure but also in the 
fact that it allows for any number of generalized measure- 
ments. Furthermore, as stated in the following formula- 
tion of the uncertainty principle based on quasi-entropic 
functions, the corresponding lower bounds to uncertainty 
are given by the associated majorization bound. 

Corollary 1. Let {y^{p),'J'^ {p), . . . ,y^{p)} be the 
set of probability vectors resulting from a set of general- 
ized measurements {X, Y, . . . , Z} on the state p. Then, 
for any uncertainty measure U'^®y®---®'^(^F, p) as defined 
in Eq. (9), and with as defined in Eq. (4), we 

have 

U^®y®-®^[F,p) > Ulf^®-®^ = F[T^®Y©...©Z] > 0^ 

unless the measurement elements {E"^, E^, . . . , E^} have 
an approximate common eigenstate, or a common eigen- 
state if the elements are compact. In either of these cases, 
U^®^®---®'^{F, p) vanishes. 

This general result is an immediate consequence of 
Theorems lA and IB and the quasi-entropic nature of 
the underlying measures. It is important to understand 
that the uncertainty bound given in Eq. (11) is valid, 
and in fact optimal, for the entire class of quasi-entropic 
measures. The optimality property of F['J'^®^®---®'^] is 

a consequence of the optimality of J''^^®'"®'^ , which is 
in fact the defining characteristic of the latter. As such, 
-^[■Psup^® "®^] cannot be expected to be optimal for in- 
dividual members of the quasi-entropic class such as the 
Shannon or Tsallis measures. 



As an example, we will calculate the Shannon-entropic 
uncertainty bound given by Eq. (11) for maximal projec- 
tive measurements of the three components of a spin-i 
system. The majorization bound for this measurement, 
ya^®ay®a,^ is givcu iu Eq. (16) of §IVB. Therefore, the 
desired entropic bound is given by H{'J'1up"^^^'^'), so that 
we can write 

^a.(Bay(Ba. (H, p) > H [yilf^y®" ' ) = 1.23. (12) 

The fact that a single majorization bound CP^^^® ' ®^ 
generates an uncertainty bound for the entire quasi- 
entropic class of measures is of course a consequence of 
its power as a comparator of disorder, as discussed in §1. 

In the remainder of this paper we will explore certain 
consequences of the majorization formulation of the un- 
certainty principle. Our objective will be to illustrate 
the power and reach of the majorization formulation, pri- 
marily in applications that are familiar from the tradi- 
tional variance and entropic formulations of the uncer- 
tainty principle. 

IV. MUTUALLY UNBIASED OBSERVABLES 

Our first application is to projective measurements 
of mutually unbiased observables in two-dimensional 
Hilbert spaces. 

A. Two spin-i components 

The simplest example of Eq. (3) is a maximal projec- 
tive measurement of a pair of mutually unbiased observ- 
ables in a two-dimensional Hilbert space, e.g., a mea- 
surement of ffx and (Ty on a spin-1/2 system. The state 
of a spin-1/2 system can in general be represented as 
p = (1 -|- CT • p)/2, corresponding to a polarization vec- 
tor p. The measurement elements, on the other hand, are 
E^;2 = (l±o-j;)/2 andE^''2 = (l±'^y)/2- A calculation us- 
ing these quantities gives CP°'='(/5) — \{l+p^) /2, {l—p.j.)/2\ 
and y"y{p) = [{l+py)/2, (1 -py)/2], whereby wc find 

= [(1 +p,)(l +p,)/4, (1 +p,)(l -p,)/4, 
(1 -p,)(l +p,)/4,(l -p,)(l -p,)/4]. (13) 

The next step is to find the supremum of y'^^®'^v (p) as /5, 
or cquivalcntly p, is varied. Following the construction 
of Eq. (2) et seq., wc first determine /ij"^ by finding the 
maximum value of a single component of y""®"'" {p) in 
Eq. (13), then p^^^ by finding the maximum value of the 
sum of a pair of components of J"^^®'^!' (p), and so on. 
Implementing this process, we find 

p^"^' = [0,(l + l/^/2)V4, 1,1,1], (14) 

where p"-^^ obtains for Pz —Q and | \px\ \ = \Py \= 1/a/2, 
and the next three components for p equal to a unit 
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vector along either the x- or y-axis. Using Eq. (14) and 
the counterpart of Eq. (2) for the suprcmum, we arrive 
at 

y'sZT' = [(l■5 + ^/2)/4,(2.5-^/2)/4,0,0]. (15) 

Thus a measurement of cTx and ay on any spin- 1/2 sys- 
tem will yield results no more certain than this supre- 
mum. Notice that while the supremum in Eq. (15) is less 
uncertain than is possible for any state of the system, 
it is not itself a possible probability vector in any actual 
measurement [2-3] . This is therefore an instance of a mea- 
surement where there is no "minimum uncertainty" state 
in the majorization sense, as discussed in §IIIA. 

We note in passing here that the infimum for the above 
measurement, T'^^f'^^ , is trivially realized on an unpolar- 
ized state (i.e., for p = 0) with all four components equal 
1/4. As such, it represents the state of maximum un- 
certainty for the measurement. If p is restricted to pure 
states, on the other hand, the supremum is still given by 
Eq. (15) but the infimum is found to be (1/2,1/2,0,0), 
which is realized when p is an eigenstate of one or the 
other of the two observables. 

B. Three spin-i components 

The extension of the above analysis to the case of 
three mutually unbiased observables, e.g., all three com- 
ponents of in the foregoing example, is analogous but 
requires the full machinery of the calculation of the supre- 
mum. Here the measurement set is (B <Jy (9 a^, with 
the six measurement elements given by £^="2 = (li<5'a:)/2, 
Ei'-j = (1 ± o-y)/2, and E!^^^ = (1 ± ^z)/2. The state is 
parametrized as above, so that p ~ {1 + a ■ p)/2. A 
straightforward calculation of the eight components of 
j)a.e<T„e^.(p) now gives {l±px){l±py){l±p^)/8, which 
extends the result given in Eq. (13) to three observables. 

The procedure for finding T^^®'^"®'^'' is the same as 
above, i.e., maximizing a single element of 5"^^®'^"®'^^ (/5), 
then the sum of a pair of components, and so on, fol- 
lowed by the "flattening" process described in §IIB to 
obtain a descending sequence. While doable analytically, 
this calculation is more conveniently done numerically 
[32]. Using either method, one finds for the first two 

components of /i*"^ the values p^^^ = (1 + l/VS) /8 and 

I^T^ = (1-1-1/ v^) / 4. The next two components emerge 
in an ascending order and must therefore be flattened, 
i.e., replaced by their mean. It turns out that these four 
components add up to unity, thus implying that the next 
four components vanish. The desired supremum is then 
found from p'"^P and is given by 

y^Zp""®"' (0.491, 0.238, 0.136, 0.136, 0, 0, 0, 0). (16) 

Thus any measurement of the three components of a spin- 
^ system will yield results more uncertain than the supre- 
mum given in Eq. (16), while the latter itself cannot be 
reached in any physically realizable measurement. 



As in the case of two spin components, y^^f " " 
trivially realized on an unpolarized state with all eight 
components equal. With p restricted to pure states, the 
supremum is unchanged while the infimum is found to be 
(0.250, 0.250, 0.250, 0.104, 0.062, 0.040, 0.034, 0.011). 

Needless to say, the results given in Eqs. (15) and (16) 
conform to the requirements of Theorem lA. 



V. CANONICALLY CONJUGATE 
OBSERVABLES 

Here we consider position and momentum, the archety- 
pal example of the uncertainty principle for canonically 
conjugate observables. Our objective is to calculate the 
leading component of y^!^^ for a projective measurement 
of position and momentum in one dimension since the 
knowledge of this component is sufficient to determine 
whether J'f®^ X as required by Theorem IB. As ex- 
pected, we will find that this condition is fulfilled in this 
case as well. 

A projective measurement of position in one dimen- 
sion, X, entails a set of measurement bins correspond- 
ing to intervals of the x-axis (where the detectors are 
positioned) [()]. Let [xi^a,X2,a] be the ath bin, with 
the corresponding Hilbert space projection operator, 
and similarly H^ and [pi,^,P2,^] for momentum (p) mea- 
surement [■y-')]. Note that these projection operators are 
the measurement elements for this measurement set, i.e., 
= IIq and E^ = H^. The explicit representation 
of these projection operators in coordinate space are as 
follows: 

(x I n;:^ I x') = s{x - x')e{xi,a - x)e{x - X2.a), 

{x I n^, I x') = ^ exp[zp-^(x - x')f-^^^^^^-,^^, 

(17) 

where pp = {p2.j3 + Pi,/3)/2 and Ap^ = p2,/J - Pi,p- We 
have also set fi, = 1 in Eq. (17) to simplify the writing. 

Our objective is to find the maximum value of 
T;!^®^, which by definition equals J'q(/5)TJ^(p) = 

tr(H||j/5)tr(n^p), as p is varied. Since p is a convex mix- 
ture of pure states, the desired maximum will be realized 
on pure states for which p — J- |'0)(V'|- We are thus looking 
to maximize (V'|n^|^/')(V'|n^l'/')/(^l^> by varying |?/;), or 
equivalently (?/;|. 

A variation with respect to gives the eigenvalue 
equation 

c+m = \r). (18) 

where 

£± = (n-/y-±n^/T^*)/2, (19) 

and where we have used a star to signify an opti- 
mized quantity so that T^* = (■0*|II^|?/'*)/(-0*|-0*) and 
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yP* = (■0*|fl5^|V'*)/(V'*IV'*)- The two operators t± de- 
fined above are clearly bounded and self-adjoint, with £+ 
positive as well. Furthermore, £_ has a vanishing expec- 
tation value in the optimized state by definition. 

Using Eq. (18) and the fact that (^/'^l-C- = 0, we 
find that + i-C+\il)*) = 0. If we then apply 

the definitions of CPj^* and CP^* to this equation, we arrive 
at the equality = CP^* . This equality is a consequence 
of the symmetry with respect to the x ^ p exchange in 
the above optimization problem. 

At this point we appeal to Eq. (7) of §IIIB, which 

implies that V^Pppip) < jW^l+^^f- This in- 
equality, in view of T^* ~ T^*, in turn implies that 
+ II- Comparing this to Eq. (18), 



we conclude that 



mx 



= ^1 



(20) 



Thus the eigenvalue of unity in Eq. (18) is in fact the 
maximum for the operator C^. Our task then is to find 



To that end, we left-multiply Eq. (18) by £+, use 
Eq. (18) again together with the idempotent property 
of the projection operators to eliminate all operators ex- 
cept n^n^, and rewrite the resulting equation in terms 

of |(/)*) = nj^lV'*). The result is the transformed equation 



TT^ TT^ TT^ 



{2^1 



I) 



(21) 



Thus {27^ — 1) equals the largest eigenvalue of the 
positive operator Il^n^II^, which we denote by fi^^x- 



Consequently, the desired maximum, 'J>^®p* 
equals t(1 + Pmax, 



a/3 



At this point we observe that 

II n^n^njlj ||< l since it involves a product of projection 
operators. This in turn implies that Umax 1 and conse- 
quently T^^^P* < 1 as well. Furthermore, Tl^^" = 1 is ex- 
cluded since it would imply || Ej?^ + || = 2 by Eq. (20) 
and an approximate common eigenstate for || || and 
II II which is not possible for finite x and p bins. 

To simplify the notation, we can without a loss of gen- 
erality translate the x and p axes such the two bins a 
and /? for which the optimal values above are reached 
are symmetrically centered at a; = and p ~ with 
X2,a = -xi^a = Ax/2 and p2,p = -pi,/3 = Ap/2. Using 
the representations of the projection operators given in 
Eq. (17), we can write Eq. (21) as the following integral 
equation: 

1 ,sin[Ap^(x-^ , 

dx : — (j) [X ) = ^J.„,ax4> [X), 



2tt 



-Ax/2 



{x-x')/2 



(22) 

where <j)*{x) = {x \ (j)*) is the wavefunction corresponding 
to the state \4>*) = with x and x' restricted to 

the interval [—Ax/2,+Ax/2]. It is convenient to rescale 



Eq. (22) by measuring x and x' in units of Ax. The 
resulting equation can then be written as 

1 ^"'^..^■■l-K-f')] p3, 



'J-i,f^ (i-C) 



where s = {Ax){Ap)/2TTh and ^ and ^' are restricted to 
the interval [—1/2, +1/2]. Note that h has been restored 
to the expression for s here. 

The largest eigenvalue of the integral equation (23) is 
thus equal to fi^ax- Its kernel, on the other hand, is a 
positive operator bounded by unity according to Eq. (20) 
et seq. In addition, the square of this operator has a fi- 
nite trace, which implies that the kernel belongs to the 
Hilbert-Schmidt class of operators and is therefore com- 
pact as well [2.^]. This confirms that the spectrum of 
Eq. (23) is discrete and of finite multiplicity (except pos- 
sibly for zero), confined to the interval from zero to one, 
and can only cluster around zero. Furthermore, the sum 
of the eigenvalues of the operator, which may be found 
by calculating its trace, equals s. 

The spectrum of Eq. (23) can be intuitively captured 
by considering (AxAp) as the "volume of phase space" 
and of s = (Aa;)(Ap)/27rfi, as the "number of states" as 
well as the number of (non-zero) eigenvalues. For a large 
phase space volume, there are s eigenstates with non-zero 
eigenvalues nearly equal to unity so that their sum should 
be of the order of s, which agrees with the trace of the 
kernel found above. Indeed a perturbative treatment of 
Eq. (23) for large values of s confirms this interpretation 
[34]. Indeed for s — > oo. we find, using the stationary 
phase approximation, that the kernel of Eq. (23) effec- 
tively approaches — and = H^max ^ 1- This 
is the limit of low-precision measurements with position 
or momentum bins approaching the entire span of the x 
or p space and J'^®''* 1. Therefor the measurement 
results approach (but do not reach) the limit of zero un- 
certainty for large s, albeit for measurements that yield 
correspondingly little information. The physically inter- 
esting limit is of course the opposite extreme of small 
bins and high resolution where the effects of the uncer- 
tainty principle are most strongly manifested. We will 
next consider that limit. 

For small bins and high resolution measurements, we 
expect no more than one small non-zero eigenvalue for 
Eq. (23). To verify this expectation, we observe that as 
s -^ 0, sin[s7r(e - e')]/(e - C) ^ u{0 ^ 9{1 - 4^^) 
(arbitrary normalization), and fi^ = fJ-^max ~^ where 
9 is the usual step function. Thus the largest eigenvalue 
approaches the sum of all eigenvalues, confirming that 
there is just one non-zero eigenvalue in this limit. Using 



its value, we find (T^*) 
therefor. 



(1 -I- v^) as s — >■ 0, and 



a/3 > 



(24) 



The corresponding wavefunction, 'ip*{x) = {x \ ip*), 
can be constructed by reference to the two projections 
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(j)*{x) = {x I n^lV'*) and x*{x) 



lR\tp*}. The 



first of tliese was defined above and is directly related to 
u{^) — > 6(1 — 4^^) as found in tlie foregoing paragraph. 
Tlie second, x*i^)-: can be constructed from the first us- 
ing the X ^ p symmetry mentioned above. Omitting the 
details of these steps, we can state the result as follows: 



1 



V2A: 



-.e{Ax^ - 4x^ 



Ap sin{xAp/2h) 
'AttTl {xAp/2h) 



(25) 

In view of the x ^ p symmetry mentioned above, it is 
instructive to consider the momentum-space representa- 
tion of this equation, ip*{p) = {p \ ip*), which can be 
found by Fourier transformation: 



1 



e{Ap^ - V) 



Ax sm{pAx/2h) 
Anh {pAx/2h) 



(26) 

The expected symmetry is clearly in evidence between 
Eqs. (25) and (26). It is important to remember here 
that Ax and Ap arc bin sizes and in effect represent the 
resolution of the measuring devices. They should not be 
confused with variances. 

The above results, i.e., {7''*) = {T^*) \{l + ^) 
as s — > 0, show that the state with the sharpest simulta- 
neous values of position and momentum will turn up the 
two values only 25% of the time. Note also that as a char- 
acterization of the limitations on what is knowable in a 
measurement of position and momentum, this is a more 
poignant statement of the uncertainty principle than the 
Hcisenberg inequality or the optimal entropic bound [11]. 
This is so even though we have only used the leading 
component of y^®P* for its derivation. As pointed out 
earlier, | -0*) is not a "minimum uncertainty" state in 
the usual sense as it only maximizes the leading compo- 
nent of the probability vector. The calculation of the re- 
maining terms of y^^v* in the high-precision limit, which 
will impose limitations on the readouts of more than one 
position or momentum bin at a time, is an interesting 
unsolved problem. 



VI. LEAST UNCERTAIN MEASUREMENT OF 
A STATE 

Given a state p, different measurements performed on 
it will give rise to probability vectors of varying un- 
certainty. Is there a measurement X"^ which results 
in a minimally uncertain probability vector such that 
y^{p) -< CP"^ (/5) for any measurement X? This question 
is of course only meaningful if restricted to measurements 
of comparable precision, and interesting only if applied to 
high-precision measurements such as those that are rank- 
1 , since low-precision measurements can yield probability 
vectors of arbitrarily low uncertainty. We will therefore 
look for X* among rank-1 measurements. To that end, 
we will first develop a sharpened version of Eq. (29) of 
Ref. [.35]. 



Lemma. Suppose a rank-1 measurement X is per- 
formed on a state p resulting in states {p^} with prob- 
abilities {y^}. We then have (i) ©a^^ A(/5^) < A(/5), 
where 



supx 



(27) 



and (ii) there exits a rank-1 projective measurement X"^ 
that realizes the above supremum. 

Above, A"^(-) is the spectrum of a density matrix in a 
descending order, and "©" denotes a direct sum of spec- 
tra as defined in Ref. [-V*]. The majorization condition in 
part (i) of this lemma was proved in Ref. [35]; cf. Eq. (29) 
therein. 

For the equality in part (i), as well as part (ii), we only 
need to demonstrate the existence of a rank-1 measure- 
ment that equals the right-hand side of Eq. (27), thus 
realizing the supremum. One can readily verify that 
the projective measurement defined by II* =1 a) {a |, 
where { | a) {a | } are the eigenstates of p arranged ac- 
cording to descending eigenvalues, is in fact the desired 
measurement X*. This is because with X restricted 
to rank-1 measurements, generalized or projective, ev- 
ery p'^ is pure so that X^{p'a) equals (1,0,...,0) and 
QaVaKPa) = ip) ■ For X*, On the other hand, 
y^\p) = tr(| a){a \ p) = Xi{p). Putting the last two 
equalities together, we arrive at 



Kp'J ^ ^iip), 



(28) 



showing that measurement X* realizes the supremum in 
Eq. (27). This proves the lemma, which allows us to state 
the following theorem. 

Theorem 2. The probability vector resulting from a 
rank-1 generalized measurement of a state is majorized 
by the spectrum of that state, '?^{p) ^ A(/5). Fur- 
thermore, for any quasi-entropic measure F[X{p)] < 
F[y-^{p)]. In particular, the von Neumann entropy of a 
state is the infimum of the Shannon entropy of all rank-1 
measurements of that state, i.e., S{p) < H[y-^{p)]. Fur- 
thermore, there is a projective measurement X* which 
satisfies CP-^ (p) = A(p) and saturates the stated inequal- 
ities. 

The second conclusion of Theorem 2 follows from the 
fact that A^ -< A^ implies F{\^) > F{\'^) if is a quasi- 
entropic measure (i.e., if it is a symmetric, concave func- 
tion). The choice of the Shannon entropy function H for 
F then yields the next part of Theorem 2. This part was 
also established in the first paper of Ref. [13] for projec- 
tive measurements. 

Theorem 2 provides a fundamental characterization of 
the von Neumann entropy as the greatest lower bound of 
all rank-1 measurement (Shannon) entropies. The ma- 
jorization statement of the theorem, on the other hand, 
provides an operational meaning for the spectrum of a 
quantum state (which is the set of eigenvalues of its den- 
sity operator) as the least uncertain probability vector re- 
sulting from rank-1 measurements of that state. Not sur- 
prisingly, the corresponding least uncertain measurement 
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X*, which saturates all majorization and quasi-entropic 
relations above, reproduces the pre-measurement state 
on average, i.e., in the absence of post-selection. 

Note that in the trivial case of a pure state, p =\ 
we find A(| ?/>)) = (1,0,..., 0) and V)) = I, cor- 

responding to zero uncertainty. In this case, the opti- 
mal measurement X* is given by the orthogonal pro- 
jections n| =1 I and =1 |, i = 2,3,..., 
where the states {| i)} together with | -0) form a com- 
plete orthonormal set. In case of a pure state, then, the 
least uncertain rank-1 measurement reproduces the pre- 
measurement state with certainty. This amounts to a full 
identification of the state of the quantum system, which 
embodies all available information about the system, and 
zero uncertainty. In case of a mixed state, on the other 
hand, the optimal measurement X* turns up the uncer- 
tainties that result from the impurity of the quantum 
state. In cither case, X* embodies all available informa- 
tion about the system, as would be expected of a least 
uncertain measurement of highest resolution. 

VII. CONCLUDING REMARKS 

It worth recalling here that majorization is a more 
refined comparator of uncertainty than those based on 
a scalar condition, and that the corresponding formula- 
tion of the uncertainty principle is more stringent than 
the quasi-entropic or variance formulations. We have ex- 
plained the reasons that set the majorization order apart 
from the others, and have demonstrated the consequences 
in several instances. The price of this generality is a 



mathematically more complex scheme, as one should ex- 
pect in going from a scalar to a vector formulation. Gen- 
erally speaking, one may expect the majorization formu- 
lation to be most suitable when dealing with overarching 
information-theoretical aspects of quantum systems, such 
as in Corollary 1 and §V and VI. As such, it is a useful 
complement to the entropic and variance formulations. 

In developing majorization bounds, we resorted to the 
concept of the supremum of a set of probability vectors. 
It is important to distinguish the supremum from a max- 
imal element for such a set, which would be defined as 
an element that is not majorized by any other element in 
the set. A set of probability vectors may include many 
maximal elements but no supremum, i.e., the supremum 
exists but may not be a member of the set. This is the 
reason why there is in general no "minimum uncertainty" 
state within the majorization formulation. 

In this paper we have presented a few basic ap- 
plications of the new formulation, with many others 
remaining to be worked out. A challenging case is the 
calculation of the non-leading components of T^®^* for 
position and momentum measurements. Another inter- 
esting application currently under development is the 
use of majorization uncertainty bounds for entanglement 
detection. 
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